Amendments to the Claims: 

Re-write the claims as set forth below: This listing of claims will replace all prior versions 
and listings, of claims in the application. 

Listing of Claims: 

1. (Currently amended) A multi-thread accumulation circuit that supports a 
plurality of threads, comprising: 

a first operation unit operably coupled to receive a first operand and a second operand 
corresponding to an operation code issued by a selected thread of the plurality of threads, 
wherein the operation unit combines the first and second operands to produce a first operation 
result corresponding to the selected thread; 

a plurality of accumulation registers operably coupled to the first operation unit, 
wherein each accumulation register of the plurality of accumulation registers corresponds to 
one of the plurality of threads, wherein a selected accumulation register that corresponds to 
the selected thread stores the first operation result corresponding to the selected thread; 
[[and]] 

a selection block operably coupled to the plurality of accumulation registers and the 
first operation unit, wherein the selection block selects the second operand provided to the 
first operation unit from a set of potential operands, wherein the set of potential operands 
includes contents of each accumulation register of the plurality of accumulation registers; 

a control block operably coupled to the selection block and the plurality of 
accumulation registers, wherein the control block receives information based on the operation 
code and generates control information provided to the plurality of accumulation registers 
and the selection block, wherein the control information provided to the plurality of 
accumulation registers causes the selected accumulation register to store the result 
corresponding to the selected thread when the operation code corresponds to an accumulate 
operation; 

2 

CHICAGO/#l 4 18609.2 



a second operation unit operably coupled to the first operation unit, wherein the 
second operation unit is operablv coupled to receive a third operand and a fourth operand, 
wherein the second operation unit combines the third and fourth operands to produce a 
second operation result, wherein the second operation result is provided to the first operation 
unit as the first operand; and 

an arbitration module operablv coupled to the control block and the second operation 
unit, wherein the arbitration module receives operation codes from a plurality of thread 
controllers corresponding to the plurality of threads, and wherein the arbitration module 
determines order of execution of the operation codes received based on an application 
specific prioritization scheme . 

2. (Canceled) 

3. (Currently amended) The multi-thread accumulation circuit of claim [[2]]I, 
wherein when the operation code corresponds to an accumulate operation, the control block 
provides the control information to the selection block such that the selection block selects a 
current value stored in the selected accumulation register as the second operand. 

4. (Currently amended) The multi-thread accumulation circuit of claim 3, 
wherein the first operation unit performs an addition operation such that the result of an 
accumulate operation is a sum of the current value stored in the selected accumulation 
register and the first operand. 

5. (Canceled) 
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6. (Currently amended) The multi-thread accumulation circuit of claim [[5]]I, 
wherein the second operation unit performs multiplication operations such that a plurality of 
multiply and accumulate functions are supported for the plurality of threads by the multi- 
thread accumulation circuit. 

7. (Canceled) 

8. (Currently amended) The multi-thread accumulation circuit of claim [[7]]I> 
wherein the multi-thread accumulation circuit is included in a vector engine that performs at 
least one of dot product operations, vector multiply accumulate operations, vector addition 
operations, and vector multiplication operations. 

9. (Currently amended) The multi-thread accumulation credit of claim [[2]]l 
further comprises a memory operably coupled to the selection block, the first operation unit, 
and the control block, wherein the memory stores the first operation result produced by the 
first operation unit, wherein contents of the memory are selectively included in the set of 
potential operands based on a portion of the control information generated by the control 
block. 

10. (Original) The multi -thread accumulation circuit of claim 1, wherein at least a 
portion of the plurality of accumulation registers include a first register section and a second 
register section, wherein the first register section is used for accumulation operations 
corresponding to a first set of operation codes and the second section is used for 
accumulation operations corresponding to a second set of operation codes. 
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1 1 . (Original) The multi-thread accumulation circuit of claim 10, wherein the first 
register section accumulates diffuse color information corresponding to graphics primitives, 
and wherein the second register section accumulates specular color information 
corresponding to the graphics primitives. 

12. (Currently amended) A method for performing a plurality of combine and 
accumulate operations in a multi-thread system that supports a plurality of threads, 
comprising: 

using an arbitration module to receive operation codes from a plurality of thread 
controllers corresponding to the plurality of threads and to determine the order of execution 
of the operation codes received based on an application specific prioritization scheme; 
after determining the order of execution of the operation codes: 

receiving a first set of operands corresponding to a selected thread of the 
plurality of threads, wherein the first set of operands corresponds to a first 
accumulation operation for the selected thread; 

combining the first set of operands to produce a first result; 

selecting a selected accumulation register from a plurality of accumulation 
registers based on identity of the selected thread, wherein each accumulation register 
of the plurality of accumulation registers corresponds to one of the plurality of 
threads; 

storing the first result in the selected accumulation register to produce a first 
accumulated value; 

receiving a second set of operands corresponding to the selected thread, 
wherein the second set of operands corresponds to a second accumulation operation 
for the selected thread; 
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combining the second set of operands to produce a second result; 

combining the second result with the first accumulated value to produce a 
second accumulated value; and 

storing the second accumulated value in the selected register to produce a 
second accumulated result. 

13. (Original) The method of claim 12, wherein combining the first set of 
operands includes combining the first set of operands using a multiplication operation, and 
wherein the combining the second set of operations further comprises combining the second 
set of operands using a multiplication operation. 

14. (Original) The method of claim 13, wherein combining the second result with 
the first accumulated value further comprises combining the second result with the first 
accumulated value using an addition operation such that a multiply and accumulate operation 
for the first and second sets of operands is achieved. 

15. (Original) The method of claim 12 further comprises: 

receiving subsequent sets of operands corresponding to the selected thread 
corresponding to subsequent accumulation operations for the selected thread; 
for each subsequent set of operands: 

combining the subsequent set of operands to produce a subsequent result; 

combining the subsequent result with a current value stored in the selected 
accumulation register to produce a subsequent accumulated result; and 

storing the subsequent accumulated result in the selected accumulation register such 
that the current value stored in the selected accumulation register is updated. 
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16. (Previously presented) The method of claim 12 further comprises performing 
combination operations corresponding to at least one additional thread of the plurality of 
threads subsequent to combining the first set of operands and prior to combining the second 
set of operands. 

17. (Currently amended) A multi-thread multiply and accumulate circuit, 
comprising: 

an arbitration module that receives command codes corresponding to a plurality of 
threads, wherein at least a portion of the command codes correspond to multiply and 
accumulate operations, wherein the arbitration module determines an order of execution of 
the command codes based on an application specific prioritization scheme ; 

a multiplier operably coupled to the arbitration module, wherein the multiplier 
combines a set of operands corresponding to each command code being executed to produce 
a product corresponding to a selected thread from which the command code being executed 
originated; 

an adder operably coupled to the multiplier, wherein the adder combines the product 
of the multiplier with a second operand that is received to produce a sum corresponding to 
selected thread; 

a plurality of accumulation registers operably coupled to the adder, wherein each of 
the plurality of accumulation registers corresponds to one of the plurality of threads, wherein 
a selected accumulation register that corresponds to the selected thread stores the sum 
corresponding to the selected thread; and 

a selection block operably coupled to the plurality of accumulation registers and the 
adder, wherein the selection block selects the second operand from a set of potential operands 
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based on control information derived from the command code being executed, wherein the 
set of potential operands includes values stored in each of the plurality of accumulation 
registers. 

18. (Original) The circuit of claim 17, wherein the set of potential operands 
includes at least one additional operand, wherein the at least one additional operand is at least 
one of a constant, a state variable, and data stored in a memory structure as a result of 
previous operations performed by the circuit. 

19. (Original) The circuit of claim 1 7, wherein at least a portion of the plurality of 
accumulation registers include a first register section and a second register section, wherein 
the first register section is used for accumulation operations corresponding to a first set of 
operation codes and the second section is used for accumulation operations corresponding to 
a second set of operation codes. 

20. (Original) The circuit of claim 19, wherein the first register section 
accumulates diffuse color information corresponding to graphics primitives, and wherein the 
second register section accumulates specular color information corresponding to the graphics 
primitives. 

21. (Currently amended) A multi-thread accumulation circuit that supports a 
plurality of threads, comprising: 

a first operation unit operably coupled to receive a first operand and a second operand 
corresponding to an operation code issued by a selected thread of the plurality of threads, 
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wherein the operation unit combines the first and second operands to produce a first operation 
result corresponding to the selected thread; 

a plurality of accumulation registers operably coupled to the first operation unit, 
wherein each accumulation register of the plurality of accumulation registers corresponds to 
one of the plurality of threads, wherein a selected accumulation register that corresponds to 
the selected thread stores the first operation result corresponding to the selected thread; 
[[and]] 

a selection block operably coupled to the plurality of accumulation registers and the 
first operation unit, wherein the selection block selects the second operand provided to the 
first operation unit from a set of potential operands, wherein the set of potential operands 
includes contents of each accumulation register of the plurality of accumulation registers, 
wherein when the operation code is dependent on the results of a previously issued operation 
code, the selection block will not release the dependent operation code until a predetermined 
amount of time has passed corresponding to the latency associated with executing the 
previously issued operation code; 

a control block operably coupled to the selection block and the plurality of 
accumulation registers, wherein the control block receives information based on the operation 
code and generates control information provided to the plurality of accumulation registers 
and the selection block, wherein the control information provided to the plurality of 
accumulation registers causes the selected accumulation register to store the result 
corresponding to the selected thread when the operation code corresponds to an accumulate 
operation; 

a second operation unit operably coupled to the first operation unit wherein the 
second operation unit is operably coupled to receive a third operand and a fourth operand, 
wherein the second operation unit combines the third and fourth operands to produce a 
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second operation result, wherein the second operation result is provided to the first operation 
unit as the first operand; and 

an arbitration module operablv coupled to the control block and the second operation 
unit, wherein the arbitration module receives operation codes from a plurality of thread 
controllers corresponding to the plurality of threads, and wherein the arbitration module 
determines order of execution of the operation codes received based on an application 
specific prioritization scheme. 

22. (Previously presented) The multi-thread accumulation circuit of claim 21 
further comprises a control block operably coupled to the selection block and the plurality of 
accumulation registers, wherein the control block receives information based on the operation 
code and generates control information provided to the plurality of accumulation registers 
and the selection block, wherein the control information provided to the plurality of 
accumulation registers causes the selected accumulation register to store the result 
corresponding to the selected thread when the operation code corresponds to an accumulate 
operation. 

23. (Currently amended) A method for performing a plurality of combine and 
accumulate operations in a multi-thread system that supports a plurality of threads, 
comprising: 

using an arbitration module to receive operation codes from a plurality of thread 
controllers corresponding to the plurality of threads and to determine the order of execution 
of the operation codes received based on an application specific prioritization scheme; 

after determining the order of execution of the operation codes: 
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receiving a first set of operands corresponding to a selected thread of the 
plurality of threads, wherein the first set of operands corresponds to a first 
accumulation operation for the selected thread; 

combining the first set of operands using a multiplication operation to produce 
a first result; 

selecting a selected accumulation register from a plurality of accumulation 
registers based on identity of the selected thread, wherein each accumulation register 
of the plurality of accumulation registers corresponds to one of the plurality of 
threads; 

storing the first result in the selected accumulation register to produce a first 
accumulated value; 

receiving a second set of operands corresponding to the selected thread, 
wherein the second set of operands corresponds to a second accumulation operation 
for the selected thread; 

combining the second set of operands using a multiplication operation to 
produce a second result; 

combining the second result with the first accumulated value to produce a 
second accumulated value; [[and]] 

storing the second accumulated value in the selected register to produce a 
second accumulated result; [[and]] 

performing combination operations corresponding to at least one additional 
thread of the plurality of threads subsequent to combining the first set of operands and 
prior to combining the second set of operands [[,]] ; and 

wherein combining the second result with the first accumulated value further 
comprises combining the second result with the first accumulated value using an 
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addition operation such that a multiply and accumulate operation for the first and 
second sets of operands is achieved. 

24. (Currently amended) The method of claim [[12]]23 further comprises: 
receiving subsequent sets of operands corresponding to the selected thread 
corresponding to subsequent accumulation operations for the selected thread; 
for each subsequent set of operands: 

combining the subsequent set of operands to produce a subsequent result; 

combining the subsequent result with a current value stored in the selected 
accumulation register to produce a subsequent accumulated result; and 

storing the subsequent accumulated result in the selected accumulation register such 
that the current value stored in the selected accumulation register is updated. 
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